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Abstract — This work investigates tlie effect of finite buffer sizes 
on tlie tlirougliput capacity and pacltet delay of line networks 
with packet erasure links that have perfect feedback. These 
performance measures are shown to be linked to the stationary 
distribution of an underlying irreducible Markov chain that 
models the system exactly. Using simple strategies, bounds on 
the throughput capacity are derived. The work then presents two 
iterative schemes to approximate the steady-state distribution of 
node occupancies by decoupling the chain to smaller queueing 
blocks. These approximate solutions are used to understand the 
effect of buffer sizes on throughput capacity and the distribution 
of packet delay. Using the exact modeling for line networks, it is 
shown that the throughput capacity is unaltered in the absence 
of hop-by-hop feedback provided packet-level network coding 
is allowed. Finally, using simulations, it is confirmed that the 
proposed framework yields accurate estimates of the throughput 
capacity and delay distribution and captures the vital trends and 
tradeoffs in these networks. 

Index Terms — Finite buffer, line network, Markov chain, net- 
work coding, packet delay, throughput capacity. 

I. Introduction 

In networks, packets have to be routed between nodes 
through a series of intermediate relay nodes. Each intermediate 
node in the network may receive packets via multiple data 
streams that are routed simultaneously from their source nodes 
to their respective destinations. In such conditions, packets 
may have to be stored at intermediate nodes for transmission 
at a later time. If buffers are unlimited, intermediate nodes 
need not have to reject or drop arriving packets. However, in 
practice, buffers are limited in size. Although a large buffer 
size is preferred to minimize packet drops, large buffers have 
an adverse effect on the latency, i.e., the delay experienced by 
packets stored in the network. Further, using larger buffer sizes 
at intermediate nodes would also result in secondary practical 
issues such as increased memory-access latency. Though our 
work is motivated by such concerns, our work is far from 
modeling realistic conditions. This work modestly aims at 
providing a theoretical framework to understand the funda- 
mental limits of single information flow in finite-buffer line 
networks and investigates the tradeoffs between throughput, 
packet delay and buffer size. 

The problem of computing capacitjQ and designing efficient 
coding schemes for lossy wired and wireless networks has 
been widely studied |1 1-|5 |. However, the study of capacity of 
networks with finite buffer sizes has been limited. This can be 
attributed solely to the fact that analysis of finite buffer systems 
are generally more challenging. With the advent of network 
coding as an elegant and effective tool for attaining optimum 
network performance, the interest in finite-buffer networks has 
increased |5|-|8|. 

'in this work, we use capacity to refer to the throughput capacity, i.e., the 
supremum of all rates of information flow achievable by any coding scheme. 



The problem of studying lossy networks with finite buffers 
has been investigated in the area of queueing theory under a 
different but similar framework. The queueing theory frame- 
work attempts to model packets in a network as customers, 
the delay due to packet loss over links as service times in 
the nodes, and the buffer size at intermediate nodes as the 
queue size. Further, the phenomenon of packet overflow in 
communications network is modeled by blocking (commonly 
known as type II or blocking after service) in queueing 
networks |9|. However, this packet-customer equivalence fails 
in general network topologies due to the following reason. 
When the communications network contains multiple disjoint 
paths from the source to the destination, the source node can 
choose to duplicate packets on multiple paths to minimize 
delay. This replicating strategy cannot be captured directly 
in the customer-server based queueing model. Therefore, 
the queueing framework cannot be directly applied to study 
packet traffic in general communications networks. However, 
queueing theory offers solid foundation for studying buffer 
occupancies and packet flow traffic in line networks. There 
has been extensive study in queueing theory literature on 
the behavior of open tandem queues, which are analogous to 
line networks ifTol -lfTS I. However, approaches from queueing 
theory literature predominantly consider a continuous-time 
model for arrival and departure of customers/packets. In this 
work, we consider a discrete-time model for packet arrival and 
departure processes by lumping time into epochs. This model 
is similar to those in lfT6l . iflTl . 

The broad contributions of this paper can be summarized as 
follows. The bulk of this work operates under the assumption 
of perfect hop-by-hop feedback. We present a Markov-chain 
based model for exact analysis of line networks. The capacity 
of a line network is shown to be related to the steady-state 
distribution of an underlying chain, whose state space grows 
exponentially in the number of hops in the network. Simple 
assumptions of renewalness of intermediate packet processes 
are employed to estimate the capacity of such networks. 
The estimates are exact for two-hop networks. However, the 
estimates extend the results of ifTTl to networks of any number 
of hops and buffer sizes of intermediate nodes. Using the esti- 
mates, the profile of packet delay is derived and studied. Using 
the exact Markov chain model in conjunction with network 
coding, it is shown that the throughput capacity is not affected 
by the absence of feedback in line networks. This result is 
similar to the information-theoretic result that feedback does 
not increase capacity of point-to-point channels 1 1 8 1 . Finally, 
simulations reveal that our estimates closely predict the trends 
and tradeoffs between hop-length, buffer size, latency, and 
throughput in these networks. 

This paper is organized as follows. First, we present the 
formal definition of the problem and the network model in 
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Section |ll] Next, we present our framework for analyzing 
capacity of finite-buffer line networks in Section |lll] The pro- 
posed Markovian framework is then employed to investigate 
packet delay in Section |IV] We compare our analytical results 
with simulations in Section [V] and conclude it with a brief 
discussion on the inter-dependence of buffer usage, capacity 
and delay. Finally, Section |VI] concludes the paper 

II. Network Model and Problem Statement 

This work focuses on the class of line networks. As il- 
lustrated in Fig. [T] h denotes the number of hops in the 
network, and V = {vo,vi, . . . ,Vh} and 1^ = {{vi,Vi+i) : 
i ~ 0, . . . , h — 1} to denote the set of nodes and the set of 
links in the network, respectively. Such a network has h — 1 
intermediate nodes, which are shown by black squares in the 
figure. Each intermediate node Vi is assumed to have a buffer 
of rrii packets. Note that buffer sizes of different nodes can 
be different. Without loss of generality, we assume h > 2 
and rrii > 0, for i = — 1. Further, it is assumed 

that the destination node has no buffer constraints and that 
the source node possesses an infinitude of innovative packets 
at all times. The system is analyzed using a discrete-time 
model, where each node can transmit at most one packet 
over a link in any epoch. Intermediate buffers are assumed 
to be empty at epoch I = and the dynamics for Z > are 
steered by the loss processes on the edges of the network. The 
loss process on each link is assumed to be memoryless and 
statistically independent of the loss processes on other links. 
We let Ei+i e (0, 1) to denote the erasure probability on the 
link for i = 0, . . . , h — 1. In this model, a node 

receives a packet on an incoming link when the neighboring 
upstream node transmits a packet and when the packet is not 
erased over the link. The reader is directed to Appendix |A] 
for a discussion on how the assumed discrete-time model 
relates to continuous-time exponential model that is commonly 
employed in queueing theory. 

For the bulk of this work, we assume that the network 
has a perfect hop-by-hop feedback mechanism indicating the 
transmitting node of the receipt and storage of the transmitted 
packet by the receiving node. However, a subsequent section 
of this paper drops this assumption to study the capacity of 
line networks without feedback. It is also assumed in this work 
that nodes operate in a transmit-first mode, i.e., each node first 
generates a packet (if it has a non-empty buffer) and transmits 
it on the outgoing edge. The node then processes the buffer 
after receiving the acknowledgement from the next-hop node 
before accepting/storing the packet on its incoming edg^ 

For notational convenience, the random process on the 
link {vi-i,Vi) is denoted by {Xi{l)}'s,^g. Xi{l) = 1 if and 
only if the packet transmitted at epoch / is deleted by the 
channel {vi-i,Vi), and Xi{l) = otherwise. For the sake 
of succinctness, we let £ = {ei, . . . ,eh) and buffer sizes 
M = (mi,...,mft_i). 

The focus of this paper is two-fold. The foremost aim is 
to identify the supremum of all rates that are achievable by 

^Note that the need for such an ordering arises due to the discrete nature 
of time assumed in this work. 
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Fig. L An illustration of the line networl:. 



the use of any coding strategy between the ends of a line 
network with erasure probabilities £ and buffer sizes M.. In 
the line network illustrated in Fig. [T] we first aim to identify 
the maximum rate of information that the node t^o can transmit 
to node Vh, which is denoted by C^^ {£ , M). The next issue 
on which we focus is the delay experienced by packets in 
intermediate node buffers when the network operates near the 
throughput capacity. 

In our analysis, we employ the following notations. Vectors 
will be denoted by boldface letters, eg., r, s. The indicator 
function for the set ]R>o ~ (0, oo) is represented by 
For any x € [0,1], x — 1 — x. The convolution operator is 
denoted by (g) and (X)'/ is used as a shorthand for the Z-fold 
convolution of / with itself. For < A < 1, G(A) denotes the 
probability mass function of a positive random variable that 
is geometric with mean For a discrete random variable 
Z with probability mass function fz, (Z) and (fz) are both 
used to denote the mean of the random variable Z. Lastly, for 
appropriate g e N, Fg denotes the Galois field of size q. 

III. Capacity of Line Networks 

In this section, we investigate the effect of finite buffers on 
the capacity of line networks. First, we present a framework for 
exact computation of the capacity of line networks that have 
perfect hop-by-hop feedback. We then present bounds on the 
capacity using techniques from queueing theory. Subsequently, 
we present our approaches to approximate the capacity of a 
line network. In the concluding subsection, we illustrate that 
the throughput capacity remains unaltered when feedback is 
absent provided packet-level network coding is allowed. 

A. Exact Computation of Capacity 

The problem of identifying capacity is related to the prob- 
lem of identifying schemes that are rate-optimal. In the pres- 
ence of lossless hop-by-hop feedback, the scheme performing 
the following steps in the given order is rate-optimal. 

1 . If the buffer of a node is not empty at an epoch, then it 
must transmit one of the stored packets at that time. 

2. A node deletes the packet transmitted at an epoch if it 
receives an acknowledgement of packet storage from the 
next-hop node at that epoch. 

3. After performing 1 and 2, a node accepts an arriving 
packet if it has space in its buffer and sends an acknowl- 
edgment of packet storage to the previous node. 

Notice that in the above scheme, at each epoch, the buffer 
of the last intermediate node is updated first, and the buffer 
of the first intermediate node is updated last. To determine 
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the throughput capacity of the network, we need to track 
the number of packets that each node possesses at every 
instant of time by using the rules of buffer update under the 
above optimal scheme. Let n(^) — {ni{l), . . . , nh~i{l)) be the 
vector whose component denotes the number of packets Vi 
possesses at time /. The variation of state at the epoch can 
be tracked using auxiliary random variables Yi{l) defined by 



■(7[n,_i(0]X,(0 
X,{l)a[m,~n,{l)+Y,+i{l)] 



i — h 

1 < i < h . 

i = 1 

(1) 



From the definition of the auxiliary binary random variables 
in ([T]i, we see that Yi{l) ~ 1 only if all the following three 
conditions are met; 

1. Node Vi^i has a packet to transmit to Vi. 

2. The link {vi^i,Vi) does not erase the packet at the 
epoch, i.e., Xi{l) — 1, and 

3. Node Vi is not full after its buffer update due to its 
transmission over {vi,Vi+i) at the epoch. 

The changes in the buffer states can then by seen to be given 
by the following. 

n,{l + l)=n,{l) + Y,il)-Y,+i{l), 1 < i < h. (2) 

Note that since Y{1) ^ {Yi{l), . . . ,Yh{l)) is a function 
of n(0 and X(0 = {Xi{l), . . . , XhH)), n(Z + 1) depends 
only on its previous state n(Z) and the channel conditions 
X(Z) at the epoch. Hence, {n(Z)}ig^>g forms a Markov 
chain. The number of states corresponds to the number of 
possible assignments to n{l), which amounts to Y[i=i (''^i^^) 
possibilities. However, since at each time instant the number 
of packets that can be transmitted over any link is bounded 
by unity, we see that for every i = 1, . . . , h ~ 1 and / G Z>o, 

y^(/)£{0, 1} and \ni{l + l)-rM{l)\<l. (3) 

Therefore, the number of non-zero entries in each row 
of the probability transition matrij(0 P{£,A4) represent- 
ing the transitions in the occupancy is bounded above by 
min(3"-i, n!=i'("^. + l))- 

A detailed categorization of the states S that enables further 
understanding can be performed thus. We can order the states 
of the chain in such a way that the state (si, . . . , Sh-i) £ S 
corresponds to the row index 1 + si + J2i=2 11}=! (™i + 1) 
in the matrix P{£,A4). Denote T, to be the set of states that 
have Sh-i = i for t = 0, . . . , mh-i- Let F^, fit, F+ represent 
the transition matrices for transitions from states in to those 
in T,-i, Tt, Tt+i, respectively. Then, it can be shown that 
F+ = F+, ^ n, and F,r = F" for t = 1, . . . , nih-i - 1 
(see Lemma [U. Therefore, the transition matrix of the chain 
can be structurally represented as follows. 

^The ij"" term of the matrix P{£, M) represents the probabiUty that the 
next state is j given that state is presently i. 



P{£,M) = 
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The dynamics given by the above equation can be depicted 
pictorially by the chain in Fig. |2] Note that due to the 
finite buffer condition and the non-negativity of occupancy, 
the transitions from the first block and from the last block 
differ from the transitions from the blocks between them. 
Further, the states within each Ti, i = 0, . . . , mh-i, can be 
organized into mh-2 + 1 sets in a similar fashion. In addition 
to this structural property, the transition sub-matrices satisfy 
the following algebraic properties. 
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Fig. 2. The Markov chain for the dynamics of occupancy in a line network. 



Lemma 1: In a generic line network, the following hold. 

a. F+ = F+, n, = ni, and Fr = Fj^ for i = 
l,...,m,i_i - 1. 

b. For h > 2, is non-singular and upper triangular for 
i = 1, . . . ,TO/i-i. 

c. For h > 2, is singular and lower triangular for i = 
0, . . . , mh-i - 1 . 

d. I ~ fti is non-singular i — 0, . . . , nih-i- 

Proof: See Appendix iBl ■ 
To illustrate the implications of the above lemma, con- 
sider the Markov chain for a three-hop line network with 
erasure probabilities £ = (ei,£2,e3), and with buffer sizes 
M = (2, 2) presented in Fig. [3] For this network, the algebraic 
properties of Lemma [T] can be understood as follows. 

1. Any transition involving a decrease in the second com- 
ponent involves a non-negative change in the magnitude 
of the first component. 

2. Any horizontal transition involving a decrease in the sec- 
ond component is always feasible (provided the second 
component of the starting state is positive). 

3. Any transition involving an increase in the second com- 
ponent involves a non-positive change in the magnitude 
of the first component. 

4. Not all horizontal transitions involving an increase in 
the second component are feasible. For example, the 
transitions from the state (0, 0) to (0, 1) and from the 
state (0, 1) to (0,2) are infeasible, and hence (F(j')ii — 

(r^)ii = o. 

While the first two facts relate to the upper triangular 
structure and non-singularity of F^, the latter two relate to the 
lower triangular and singularity properties of F+. This Markov 
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chain for the dynamics of the state of a line network with 
perfect feedback is irreducible, aperiodic, positive-recurrent, 
and ergodic |19|, 120|. By ergodicity, we can obtain temporal 
averages by statistical averages. Therefore, the throughput 
capacity C^^{£, A4) can be identified by appropriately scaling 
the likelihood of the event that the system is in a state wherein 
the last node buffer is non-empty. This quantity is given by 



C^^ {£, M) = Eh Pr[{s e S : > 0}] 



(4) 



Notice that packets are not erased from the buffers without 
a receipt of acknowledgement of storage from the next-hop 
node. Therefore, the packet-flow rate is conserved. Therefore, 
for < i < h — 1, the throughput capacity can also be 
identified from 



(5) 



Thus, the problem of identifying the capacity of line networks 
is reduced to the problem of computing the steady-state prob- 
abilities of the aforementioned Markov chain. However, due to 
the size of the Markov chain and its transition matrix, and the 
presence of multiple reflections due to the limited buffers at 
intermediate nodes, the problem of computing the steady-state 
distribution and capacity is computationally tedious even for 
networks of reasonable hop-lengths and buffer sizes. 

As the first step towards estimation, we can define a finite 
sequence of matrices by 



I 

r- 



i = 
i = 1 

I < i < rrih- 



Note that these matrices relate the steady-state distribution iiTi 
of the states in T^, i = 0, . . . , m/i_i by ttt^ = HiTTTg- Using 



these relations, we can, in theory, estimate the capacity by 

1 



C 



PF 



{£,M)<eh{l 



(6) 



However, this matrix-norm approach does not provide insight 
into occupancy statistics of various nodes. Therefore, we focus 
on an approximations -based approach to capacity estimation 
in the remainder of this work. 

B. Bounds on the Capacity of Line Networks 

In queueing theory, problems of identifying the steady-state 
probability of stochastic networks have often been dealt with 
approximations. Most approaches to problems in this area 
have been to approximate the dynamics of the network by 
focussing on local dynamics of the network around each node 
and the edges incident with it. The key idea in this section is to 
modify the exact Markov chain to derive bounds on throughput 
capacity. To do so, notice that the main reason for intractability 
of the exact system is the strong dependence of Yi{l) on not 
only Yi-i{l), but also Yi+i{l). This dependence translates to 
a strong dependence of ni{l) on both ni-i{l) and ni+i{l). 
Relaxation of this strong dependence will be a step towards 
possible decoupling of the system, and a deeper understanding 
of the tradeoffs in such networks. 

Consider a network operation mode where each intermedi- 
ate note transmits an acknowledgement whenever it receives 
a packet (as opposed to the rate-optimal setting where it 
sends an acknowledgement whenever it receives and stores 
a packet successfully). Under this new mode of operation, 
we notice that the dependence of the state of node on 
^ that of nodes further downstream is eliminated. This mode of 
operation is equivalent to assuming that a packet that arrives at 
a node whose buffer is full gets lost/dropped unlike the optimal 
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mode of operation where it gets re-serviced. In this mode, the 
state updates are given by a simplified Markov chain that is 
generated by the following rule for alH G Z>o and 1 < i < h. 

fi,{l + 1) - n,{l)+Y^{l)a[m^~n,{l) + Y^+i{l)]-Y^+i{l), (7) 

where 

1 i = 1 

To avoid confusion, we appellate the chain that is obtained 
by the dynamics defined by ([T]i and (|2]i as the Exact Markov 
Chain (EMC) and the one defined by ^ and ^ as the 
Approximate Markov Chain (AMC). Also, we allow n(Z) and 
n(/) to always denote the state of an instance of the process 
generated by the EMC and the AMC, respectively. Then, the 
following property holds. 

Theorem 1: (Temporal Boundedness Property of the AMC) 
Consider a line network with h hops and an instance of 
channel realizations {Xi{l) : i = 1, . . . , /ijig^^^. Suppose 
we track the variation of the states of the EMC and the AMC 
using this instance of channel realizations with the same initial 
state n(0) = n(0). Then, for any I S K>o and 1 <i <h — l, 
the following holds. 

nS) > n,{l). (9) 

Proof: The proof is detailed in Appendix |C] ■ 
The Temporal Boundedness Property guarantees that sta- 
tistically, the probability that a node has an empty buffer is 
overestimated by the AMC. In fact, if we can identify the 
steady-state distribution of the states of AMC, we can provide 
a lower bound for the steady-state probability of any subset 
of states A Q S that have the form 

^ = |s £ 5 : (sj > aj), j = 1, . . . - l|, (10) 

where < aj < irij for j = 1, . . . , h — 1. Using the Temporal 
Boundedness property in conjunction with (|4]l, we can provide 
a lower bound C^^ {£, A4) for the capacity of the line network 
by underestimating the probability in ^ by using the steady- 
state distribution of the AMC instead of that of the EMC. 
Equivalently, the capacity of the line network is at least that 
of the throughput achievable by the AMC. This above idea 
of lower bound extends easily to an upper bound using the 
following result. The fundamental idea behind the following 
bound is to manipulate the buffer sizes at each node so that 
the packet drop in the modified network is provably infrequent 
than in the actual network. 

Theorem 2: Let the operator E be defined by (&i, . . . , 6^) M> 
(&i, &i + 62, ■ • ■ , &i + • • ■ + bk). For a given network with 
distinct erasure probabilities £ and buffer sizes Ai, denote 

—PF 

C {£,M) to be the throughput computed from the steady- 
state distribution of the AMC defined by dTli and (IHll with era- 
sure probabilities £ and buffer sizes 
C^''{£,I.{M)). Then, 

C^''{£,M) <C^^{£.M) (11) 
Proof: A detailed proof is presented in Appendix |D] ■ 



Thus, the problem of bounding capacity is reduced to 
identifying the steady-state probability of the AMC. Notice 
that the above bounds are not in a computable form, since 
they still involve identifying the steady-state distribution of 
the AMC. Even though the AMC is significantly simpler 
than the EMC, the output process from each intermediate 
node is not renewal ifTSl . Therefore, the distribution of inter- 
departure times from each intermediate node is insufficient to 
completely describe the arrival process at intermediate nodes 
Vi for 1 < i < h. Therefore, a straightforward hop-by-hop 
analysis (without further assumptions) seems insufficient to 
identify the capacity of such networks. 

C. Iterative Estimation of the Capacity of Line Networks 

In this section, we present two iterative estimates for the 
capacity of line networks that is based on certain simplifying 
assumptions regarding the EMC. We notice that the difficulty 
of exactly identifying the steady-state probabilities of the EMC 
stems from the finite buffer condition that is assumed. The 
finite buffer condition introduces a strong dependency of state 
update at a node on the state of the node that is downstream. 
This effect is caused by blocking when the state of a node 
is forced to remain unchanged because the packet that it 
transmitted is successfully delivered to the next-hop node, but 
the latter is unable to store the packet due to lack of space 
in its buffer. Additionally, the non-tractability of the EMC 
is compounded by a non-renewal packet departure process 
from each intermediate node. In this section, we ignore some 
of these issues to develop iterative methods for estimation. 
Figure|4]encapsulates the assumptions made in both estimation 
approaches. While both approaches ignore the non-renewal 
nature of packet arrival process at each node, the first approach 
makes an additional memoryless assumption on the arrival 
process. Additionally, both approaches model the effect of 
blocking by the introduction of a single parameter ph that 
represents the probability that an arriving innovative packet 
will be blocked. 

1 ) Rate-based Iterative Estimate: This estimate makes the 
following assumptions to decouple the dynamics of the system 
and enable capacity estimation. 

Al. The packet departure process at each intermediate node is 
memoryless. In other words, each node Vi sees a packet 
arrival process that is memoryless with (average) rate 
Vi packets/epoch. This assumption allows us to track 
information rates over links while simplifying the higher 
order statistics. 

A2. Any packet that is transmitted unerased by the channel 
is blocked independently with a probability 
Pbi+i- That is, for any < fc < rrii, 

Pr[K,+i(.) =0,X,+i(.) = - fc] 

Here, Pbi+i denotes the blocking probability due to full 
buffer state at Vi+i. This assumption allows us to track 
the blocking probability ignoring higher order statistics 
of the blocking process. 
A3. For each node Vi and epoch I, the event of packet arrival 
and the event of blocking from are independent of 
each other. 
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Fig. 4. Illustration of the assumptions in iterative estimation. 
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Fig. 5. The Markov chain for rii under the simplifying assumptions. 



The above assumptions are valid in the limiting case of large 
buffers provided the system corresponds to a stable queueing 
configuration. By assuming that they hold in general, the 
effect of blocking is spread equally over all non-zero states 
of occupancy at each node. Similarly, the assumptions also 
spread the arrival rate equally among all occupancy states. 
Given that the arrival rate of packets at the node Vi is 
packets/epoch, and the blocking probability of the next node 
is Pbi+i, the local dynamics of the state change for the node 
Vi under assumptions A1-A3 is given by the Markov chain of 
Fig. |5] with the parameters set to the following. 



a 



P = (1 - r.i)pbi^^et+i 



(12) 



"0 = Ti 

Using these parameters, the steady-state distributiorQ 

{(i)vi{k) = ¥'(fcki,ej+i,Pbi+i)}™io °f *s '^hain of Fig. |5] 
can be computed to be 

Lp{k\r^,ei+i,pbi+i) ={ °o°^'' • (13) 

Assuming that Vi observes a packet arrival rate of from Vj^i 
and a blocking probability of from Vi+i, the blocking 

probability pi^ that the node Vi^i perceives from the node Vi 
and the arrival rate r^+i that Vi+i observes can be computed 
via (fTSI l using the following equations. 

Pbt = + 'Si+lPb^+l)^imi\ri,ei+l,pbi+l) (14) 
Ti+i ^ei+i{l ~ (p{0\ri,ei+i,pbi+i)) (15) 

Note that the blocking probability pb^ is computed using the 
full occupancy probability of the node Vi. While in reality, 
a packet is blocked by Vi only if at the arriving instant, 

"if Pbi+i = 1. then we set ifi{nii\ri, et+i, pti^i) = 1. 



the node has full occupancy, A2 models any arriving packet 
to be blocked with the above probability irrespective of the 
occupancy of Vi. Also, in ( fT4] i and ( fTsT i the arrival rate from 
the node vi is ri = ei and the blocking probability pb/i = 0. 

Given two vectors r ~ (ri, . . . , r}^) E [0, l]'' and pb = 
{pbi, . . . ,Pbh) £ [0, 1]'\ we term (r, pb) as a rate-approximate 
solution to EMC, if they satisfy the equations ( fT4l l and ( fTSb 
in addition to having ri — ei and pb^ = 0. Since these 
relations were obtained from making assumptions on the 
EMC, it is a priori unclear if there exist rate-approximate 
solutions for a given system {£,Ai). Fortunately, the following 
result guarantees both the uniqueness and an algorithm for 
identifying the rate-approximate solution to the EMC. 

Theorem 3: Given a line network with link erasures £ = 
(£i,...,e/i) and intermediate node buffer sizes A4 — 
(mi, . . . , m/i_i), there is exactly one rate-approximate solu- 
tion {r*{£,M),Pb*i£,M)) to the EMC. Further, the rate- 
approximate solution satisfies flow conservation. That is. 



'il-pb:) = rUl-pb*), 



1 < i,j < h. 



Proof: The proof is detailed in Appendix |E] ■ 
Finally, the estimate of the capacity can be obtained from the 
rate-approximate solution by computing the average rate of 
packet storage at each node using 



C*i£,M)=r*{l-Pb:), i = l,...,h. 



(16) 
, h} can 



Note that by the conservation of flow, any i e {1, , 
be used in the above equation to identify capacity. 

As an illustration, consider a simple four-hop network with 
erasures £ = (0.5, 0.4999, 0.4998, 0.4) and buffer sizes M = 
(5, 5, 5). From the above estimation method, we arrive at 

r* = (0.5,0.46797,0.43958,0.43484), (17) 
Pb* = (0.13031,0.07078,0.01076,0), (18) 
C*{£,M) = 0.43484 packets/epoch. (19) 

From simulations, the throughput capacity was found to be 
0.43501 packets/epoch for the same network. 

2) Distribution-based Iterative Estimate: In this section, we 
assume that the given line network (£, A^) satisfies Si ^ ej 
for i 7^ j. Since the capacity of a line network is a continuous 
function of the system parameters, this assumption is not 
restrictive. A system with non-distinct erasure parameters can 
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be approximated to any degree of precision by a system with 
distinct erasure probabilities. 

Before we introduce the second approach for estimation, we 
present the following technical resuljfl wherein we denote I to 
be the identity distribution for the convolution operator. 

Theorem 4: Consider a tandem queueing system of two 
nodes where the first node possessing m buffer slots is fed 
by a renewal process whose inter-arrival time distribution is 
g'" = YIiS^ Vi'^{Q with pi,...,pAr_i 6 R and ^ 
for l<i<j<A^ — 1. Suppose that the distribution of 
service time is (G(CAf), where Cjv Ci for 1 < i < — 1. 
Further, suppose that the second node blocks an arriving packet 
memorylessly with probability g G (0, 1), and that any blocked 
packet gets re-serviced. Then, the distribution of inter-arrival 
times as seen by the second node is given by 

g™' = T(.g'",m,Cjv,g)®G'(Civ), (20) 

where T(g'", to, Cw, g) = ("I + " Z]i!=T^ Pi^^(C/)) for some 
< a < 1 and p| e R, / = 1, . . . , - 1, with Y^i P'l = 1- 

Proof: A detailed analysis including the means of iden- 
tifying a, {p^ : i = 1, . . . , N} is given in Appendix |F] ■ 
Just as in the Rate-based Iterative Estimate, this estimate 
also makes three assumptions to simplify the EMC. While the 
Distribution-based Iterative Estimate makes assumptions A2 
and A3, it relaxes assumption Al to the following; 
Al*. The packet departure process at each intermediate node 

is renewal. 

Note that Assumption Al allows for tracking only the average 
rate of information flow on edges whereas Al* allows tracking 
of the distribution of packet inter-arrival times. However, Al* 
ignores the fact that the distribution of an inter-arrival time 
changes with the knowledge of past inter-arrival times. To 
track the inter-arrival distribution and blocking probabilities 
at each node, the Distribution-based Iterative Estimate uses 
Theorem |4] in a hop-by-hop fashion. Assuming that the packet 
arrival process at Vi is renewal with an inter-arrival distribution 
fi, and that the memoryless blocking from w^+i occurs with 
probability Pbi+i, we see that the packet inter-arrival distribu- 
tion seen by Vi^i is given by 

fi+i ^ T{fi,mi,ei+i,pbi+i) (E) (G{ei+i). (21) 

Notice that just like in ( fT2] l. T uses the effective erasure 
probability to incorporate the effect of blocking by Vj+i. 
However, this corrective term does not appear in G(-) term, 
because fi+i represents the distribution of packet inter-arrival 
times at w^+i, and not the distribution of the time between 
two adjacent successful packet storages at Vi+i. Further, the 
blocking probability of Vi as perceived by Vi^i is given by 

Pbi = Pr[A packet arriving at Vi sees full buffer] (22) 

^ 'P{f^,m,,e^+l,pb.^_^^l). (23) 

Just as in the Rate-based Iterative Estimate, we call a solution 
to (l2Tl i and (|23] | with boundary conditions pbf^ — and 

'For this theorem, note that we do not require that all piS or all p^s be 
positive. We only need that their sum be unity and that they generate a valid 
probability distribution, respectively. 



/i = (G(ei) as a distribution-approximate solution. Though 
the existence and uniqueness of the distribution-approximate 
solution for a given system {£, A4) has eluded us, simulations 
reveal that for each system, the solution is unique and can be 
found by iteratively using the following algorithm. 

Algorithm 1 Distribution-based Iterative Estimate 

1: Count = 1 and pftJCount] — 0, i — 1, . . . , h — 1. 

2: while Count <Max_Iter do 

3: /i [Count] = (G(ei), pt^ [Count] = 0, and j = 1. 
4: while j < h do 

5: Compute /j+i [Count], [Count + 1] employing 

i2J[ and {23} (that use /j [Count], Pb j_^_i[Count]) 
6: j + 1- 

7: end while 
8: Count ^ Count + 1. 

9: end while 



Note that during any round of Count in the above algo- 
rithm, (ISTT l can be iteratively used to identify [Count] in 
Step 5 only if the output distribution of inter-departure times 
from each node is a weighted sum of geometric distributions. 
This is however guaranteed if the erasure probabilities of no 
two links are equal. Alternately, Step 2 can be replaced by a 
convergence-type criterion instead of the Max_Iter criterion. 
After sufficiently large number of iterations, the distributions 
and blocking probabilities usually converge (to /* and Pb*), 
and upon convergence the capacity can be estimated via 

C*i£,M)^-^ (24) 

\Jh/ 

Using the above approach for the four-hop example network 
at the end of Sec. IIII-Cll we have 

/4 = 138240.92G(0.5) - 275765.59G(0.4999) 

+ 137525.64(G(0.4998) + 0.03(G(0.4), (25) 
Pb* = (0.12983, 0.070006, 0.010406), (26) 
M) = 0.435089 packets/epoch. (27) 

D. Capacity of Line Networks without Feedback 

Feedback from next-hop node provides a natural means of 
buffer update and packet deletion. In the absence of feedback, 
due to the finiteness of buffers, each intermediate node must 
have a local rule for buffer update to accept packets that arrive. 
A rule for packet deletion or update must be maintained at 
each node so that the buffers are used efficiently. A network 
coded-scheme based on random linear combinations over a 
large finite field Fq of size q as is described in [21 1 presents an 
effective means of buffer update and packet delivery. Consider 
the following scheme based on network coding. 

1 . At each epoch, a node having a buffer size of to packets 
picks a vector a e F™ uniformly at random to generate 
a random linear combination in the following manner. 
For each buffer slot i G {1, . . . , m}, the packet Pi stored 
in that slot is represented as a vector over Fq and the 
output packet is generated by computing X^i" i o.iPi- This 
generated packet is then transmitted during the epoch. 

2. If a packet P is received by a node at an epoch, it 
first generates the output packet at that instant and then 
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updates its buffer in the following manner. The node 
selects a vector b S F™ uniformly at random and for 
each k G {l,...,m}, adds the packet bkP to the packet 
stored in the fc* buffer slot. 
Note that in the network coding scheme described above, 
after sufficient time after the commencement of packet transfer 
from the source, all buffer slots of every intermediate node 
almost always have non-trivial contents unlike the scheme 
with perfect feedback. However, it is not true that all of 
these packets are innovative, i.e., packets may contain common 
informatio^ Such a condition may occur when the packets 
are linearly dependent in the algebraic sense. With this notion 
of information, the rate of information received by the desti- 
nation node can be seen to be the asymptotic rate of arrival 
of innovative packets. The following result characterizes these 
rates achieved by the network coding scheme over the field 
Fg and relates it to throughput capacity in the presence of 
lossless feedback C^^(f 

Theorem 5: Let C^^{£,M.) denote the rate of arrival of 
innovative packets at the destination node of a line network 
without feedback assuming that the aforementioned network 
coding scheme over the field Fg is employed. Then, for each 
sequence of finite fields {F^, j/gN such that qi oo, we have 

^NFtc - cPF{£^M). (28) 



lim Cg.;(£,M) 



Proof: The proof is presented in Appendix |G] ■ 
From (|28] |. we observe that there is no loss in achievable 
rates when feedback is absent and that the aforementioned 
network coding scheme is rate-optimal for line networks 
without feedback, provided a large field size is employed. 

IV. Packet Delay Distribution 

In this section, we use the iterative estimates of Section lTlI-CI 
to obtain estimates on the probability distribution of the delay 
experienced by information packets in line networks with 
perfect feedback under the optimal strategy of Section IIII-AI 
We abstain from defining latency of data packets in networks 
without feedback, since optimal schemes for such networks 
involve packet-level coding. 

When perfect feedback is available, we define the delay 
of a packet as the time taken from the instant when the 
packet is stored in the buffer of the first intermediate node 
to the instant when the destination receives it. Since the 
delay statistics depend on how the packets are handled in 
intermediate nodes, in addition to the optimal scheme of 
Section IIII-AI we assume a first-come first-serve treatment of 
packets at intermediate node buffers. Note that this assumption 
is made only for the ease of presentation. The framework 
permits the analysis of randomized schemes where each node 
after a successful transmission selects a packet in its buffer 
randomly and memorylessly, and transmits it repeatedly until 
it is stored at the next-hop node. 

In order to compute the distribution of delay that a packet 
experiences in the network, one can proceed in a hop-by-hop 

^Here, we use information to represent tlie number of linearly independent 
packets w.r.t. the chosen base field. A set S = {Pi, ... , Pjv} is said to 
contain n packets of information if dim(span(5)) = n. 



fashion using two parameters: (1) pj, an estimate of blocking 
probability at node Vi, i — 1, . . . ,h, and (2) i)i{k), an estimate 
of the distribution of occupancy at node Vi (for each i = 
1, . . . , /i — 1 and k = 0, . . . , rrii) just before packet arrival 
conditioned on the event that the arriving packet is successfully 
stored. 

In the last relay node Vh-i, the additional delay perceived 
by a packet arriving at epoch depends on the occupancy 
nh-i{l) of the node Vh-i and Eh. Suppose at epoch I, node 
I'h-i has k < nifi-i — 1 packets excluding the arriving packet. 
Then, the packet has to wait for the k already-stored packets 
to leave before it can be serviced. Since the services are 
memoryless, the distribution of delay is given by a sum of fc+1 
independent geometric distributions each with a mean inter- 
arrival time 



-, I.e., 



(G(e/i). Hence, the distribution of 
additional delay induced by waiting in the buffer of Vh-i is 



E 

i=0 



V'„_i(z)[®'+iG(e„)]. 



(29) 



However, the situation is different for other intermediate delays 
because of the effect of blocking. The additional delay incurred 
while being stored in the node vj , < j < h — 1, is given by 



i=0 



(30) 



since a packet is deleted from the buffer of Vj only if the 
channel successfully transmits it and Uj+i does not block the 
arriving packet, which by assumption A2 occurs memorylessly 
with a probability Sj^^ = Ej+i +pj+i£j+i. Assuming that the 
delays incurred by waiting in the buffer of each node is inde- 
pendent of each other, we obtain the total delay considering 
all hops to be 



h-l- 



(31) 



Note that in addition to the above delay, the source node at- 
tempts to transmit the packet multiple times before the packet 
is successfully accepted at the first intermediate node. The 
distribution of this time spent in this is approximately given 
by a random variable whose distribution is ©(e'l). Thus, the 
iterative estimation technique provides us with a framework 
to approximately but analytically compute the delay profile 
using an estimate of distribution of packets seen by an arriving 
packet that is successfully stored, and an estimate of the 
blocking probabilities. 

Finally, the pair of estimates (ipji'), Pj) can be obtained 
from the rate-approximate solution by using 



i = 



1 < z < rrii 



(32) 
(33) 



Similarly, another pair of estimates can be obtained using 
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the Distribution-based Iterative Estimate by 



Pj = Phi , 



i-Pbj 

7r* (m)-pb* 



0,. 



i — m-i — 1 



(34) 
(35) 



where 7r*( ) is the eigenvector of the f5% upon convergence. 
Combining the above equations, two estimates for the delay 
profile for line networks with feedback can be obtained. 

V. Results of Simulation 

In this section we present the results of simulation com- 
paring our analytic results to simulations of line networks 
with perfect feedback. First, the simulations for the capacity 
are presented, and then the simulations for delay profiles are 
presented. This section ends with a discussion on the efficient 
usage of buffers and the interplay of buffer size, capacity and 
delay. 

In our model, a line network is completely defined by 
the number of hops, the erasure probability for each link 
and the buffer size at each intermediate node. To study the 
accuracy of our bounds and estimates, we vary one of these 
three parameters while keeping the remaining two fixed. In 
each of the figures, the actual capacity and bounds obtained 
via simulations are presented in addition to our estimates. 
Further, for the sake of brevity, we abbreviate Distribution- 
based Iterative Estimate (Algorithm [Til), Rate-based Iterative 
Estimates (Algorithm |2]l. Lower Bound (Thm. [U, and Upper 
Bound (Thm. ^ to DblE, RblE, LB, and UB, respectively. 

Figure |6] presents the variation of the capacity with the 
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Fig. 6. Capacity of line networks with m = 5 and varying hop-length h. 

number of hops of line networks when each intermediate node 
possesses a buffer size of five packets. The figure presents 
simulations for networks when the probability of erasure on 
each link is set to either 0.25 or 0.5. First, it is noticed that the 
bounds and iterative estimates agree with the actual capacity 
for two-hop networks. Second, it is noticed that the bounds 
and estimates capture the variation of the actual capacity of 
the network. However, the estimates are more accurate. For 
both choices of channel parameters, both estimates predict 
throughput capacity within an error of 1%. Further, it is also 



noticed from the figures that the independence assumptions of 
the estimates generally over-estimate the actual capacity of the 
network. 

In order to study the effect of buffer size on capacity, 
we simulated a five-hop line network with each link having 
erasure probabilities just as in the previous setting. Figure |7] 
presents the variation of our results and the actual capacity as 
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Fig. 7. Capacity of line networks with h = 5 with varying buffer size m. 

the buffer size of the intermediate node is varied. It can be 
seen that as the buffer size is increased, all curves approach 
the ideal min-cut capacity of 1 — e. Also, as is expected, the 
accuracy of the bounds improve with the buffer size. 

Finally, the effect of the channel conditions on the capacity 
of a five-hop line network with intermediate buffer sizes of 
five packets each is presented in Figure [8] It is noticed that as 
the probability of erasure increases, the loss in capacity due to 



-T-UB 

■^LB 

— Capacity 
--RblE 
+ DblE 

• Min-cut Capacity 




0.15 0.2 0.25 0.3 0.35 0.4 0.45 
Probability of erasure e 

Fig. 8. Capacity of line networks with h = 5, m = 5 with varying erasure 
probability e. 



finite buffer becomes more pronounced. For example, for the 
simulation setting of Fig. [8] the loss in capacity varies from 
3.85% at e = 0.1 to 16.1% at e = 0.5 in a near-linear fashion. 
From these figures, we infer that it is paramount that the effect 
of blocking be considered as reaUstically as possible. Modeling 
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the effect of blocking as packet loss (as is done to derive our 
bounds) only allows us to loosely bound the capacity of such 
networks. 
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Fig. 9. FCFS delay profiles in 8-liop line networks with different buffers. 

Figure |9] presents the variation of delay profile for the 
optimal strategy in an eight-hop line network with the erasure 
probability on every link set to 0.25. The delay profiles were 
simulated for three different buffer sizes. As in Section HV] 
the estimate and simulations were performed for the first- 
come first-serve strategy. From the first sub-plot, it is noticed 
that both mean and variance of the delay distribution increase 
as buffer sizes increase. While the mean delay obtained via 
simulations for the three memory settings are 30.22, 55.18, and 
81.29 epochs, whereas the analytical result for the same using 
the Dist.-based Iterative Estimate are 30.09, 55.22, and 81.68 
epochs, respectively. Note that the analytical estimates for the 
mean delay ^*{£,A4) can be obtained without computing the 
delay profile by the use of Little's theorem [22] as follows. 



h-l 



/i-i 



^C(f,X) ^e;.(l-<_^(0))' 



(36) 



where, as before, 0* (•) denotes the distribution of occupancy 
of Vi at steady state given by the Rate-based Estimate. Note 
that each term in the above sum can be viewed as the 
contribution of the corresponding node to overall delay. It 
is noted that the analytic prediction of the delay profile is 
more conservative than the actual delay profile in the sense 
that the estimate of the variance is higher than the actual 
variance of packet delay. The second sub-plot of the figure 
illustrates the difference in the cumulative distribution of delay 
predicted by the two estimates. It is noticed from all the above 
simulations that there is only a minor difference between the 
two estimation schemes if the parameters of interest are either 
the throughput capacity or the delay profile. 

Figure [TO] highlights the difference between the two es- 
timates when continuous-time models are emulated using 
discrete-time epochs. Consider a three-hop line network where 
intermediate nodes have a buffer of three packets and their 
packet service distributions are exponential with (A2,A3) = 
(3, 2.99) s^^. Suppose that the arrival process at the first node 



is renewal with inter-arrival distribution being exponential with 
Ai ~ 10 s~^. The following figure presents the distribution of 
inter-departure duration from the second node. It is observed 
that by lumping A = 0.001 seconds into each epoch, the 
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Fig. 10. The probability density of packet inter-arrival duration at the 
destination in a three-hop continuous-time line network. 

Distribution-based Iterative Estimate provides a near-accurate 
distribution of the inter-departure durations. On the other hand, 
the Rate-based Iterative Estimate approximates the distribution 
as an exponential, which yields a less accurate estimate. Note 
that for this setting Ca{M) = 2.2467 packets/sec, and the 
Distribution-based and Rate-based Estimates are 2.2447 and 
2.2413 packets/sec, respectively. 

A. Buffer Allocation in Line Networks 

In this section, we present a brief discussion on two ques- 
tions pertaining to efficient usage of buffers in intermediate 
nodes. Is the use of more buffer slots, the merrier? and How 
to allocate buffers to different nodes so that operation ensures 
near-min-cut throughput and acceptable delay? 

To address the first question, consider the eight-hop network 
of Fig. |9] As the buffer size is varied from 10 to 15 packets, 
the Rate-based Estimate for capacity changes from 0.7135 to 
0.7254 packets/epoch - a change of less than 1.5% (of the min- 
cut bound). However, the mean latency changes from 55.18 to 
8 1 .29 epochs - a 47% change. Therefore, for each 8, it is likely 
that there is a critical buffer size for each node beyond which 
the throughput capacity improvement is marginal; however, 
with increase in buffer sizes, the average time packets spend 
in the network continues to grows significantly. One must 
therefore identify the correct size of buffers to be used so 
that both latency and throughput capacity are acceptable. 

To discuss the second issue, we illustrate with the fol- 
lowing example. Consider a four-hop network with £ — 
[0.3 0.5 0.5 0.2] for which a good choice of buffer allocation 
needs to be identified under the constraint that the total number 
of buffers in the network must be no more than 30 packets. To 
this end, we use the Rate-based Estimate to study the effect 
of individual buffer sizes on throughput and delay. Fig. [TT| 
shows the variation of the throughput and delay contributed 
by each node when its memory is varied from 1 to 20 packets. 
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Fig. 11. Throughput, and average delay contribution by each intermediate 
node with varying buffer size. 



while the buffer sizes of other intermediate nodes are kept 
at 20 packets. In this example, it is noticed that maximum 
throughput estimate for all choices of memory estimates is 
0.4871 packets/epoch when Ma = (5, 21,4). This setting of- 
fers a mean packet delay of 32.24 epochs. However, minimum 
delay configuration amongst those that offer a throughput more 
than 0.485 packets/epoch is Aib = (4,20,6), which offers a 
throughput of 0.4851 packets/epoch and a mean packet latency 
of 28.46 epochs. The actual capacity and delay for these 
configurations were found to be C{£,Ma) = 0.4871 pack- 
ets/epoch, n{£, Ma) = 32.17 epochs and C{£, Mb) = 0.4858 
packets/epoch, fi{£,Mb) = 28.33 epochs, respectively. 

To understand further these patterns, we present in Fig. [12] 
the steady-state occupancy of the three intermediate nodes 
when buffer sizes are set to Ma — (5,21,4) packets, Mb — 
(4, 20, 6) packets and Mc — (15, 15, 15) packets, respectively. 
In all settings, it is noted that the node vi is congested because 
the sub-network from vi to 774 has a min-cut capacity of 0.5, 
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Fig. 12. Estimated Buffer occupancy distribution in intermediate nodes. 

whereas it receives packets at the rate of 0.7. Therefore, the 
steady-state occupancy of the node vi for Ma and Mc are 
translates of that of A^f,. Due to congestion, an arriving packet 
at such a node usually sees very high occupancy. Hence, 



in a first-come first-serve mode of operation, the arriving 
packet has to wait long before getting serviced. Therefore, 
it is critical that the buffer size of congested nodes (such as 
Vl) be kept to absolute minimum to minimize average packet 
delay. Similarly, U3 can at most receive packets at a rate of 
0.5, however the outgoing link can communicate packets at 
a much higher rate. Therefore, the buffer of U3 is never full 
as long as the buffer size is greater than five. Nodes such 
as V3 that are never congested contribute little to the delay 
experienced by packets. Hence, limiting buffer sizes of such 
nodes is not critical for delay as long as the sizes are bigger 
than their threshold sizes (beyond which throughput increase 
is marginal). 

Occupancy in nodes like V2 that are neither congested nor 
starved undergo non-trivial changes with changes in buffer 
sizes. These nodes contribute significantly to both the through- 
put and average packet delay in the network. For example, in 
the example network V2 has a near-uniform distribution for 
both Ma and Mb- Just like congested nodes, such nodes 
have to be allocated buffer sizes so that the they neither block 
packets nor contribute to delay significantly. Though the clas- 
sification of nodes as congested, starved or neither can usually 
be done by focusing on £, good memory allocation requires 
knowledge of trends of latency and throughput with buffer 
sizes, which in turn require the help of more sophisticated 
estimates such as those proposed in this work. 

As a second example, consider another four-hop network 
with £c = [0.51 0.50 0.49 0.48]. In the infinite buffer setting, 
the queueing system corresponding to this buffer configura- 
tion is stable. Hence, no node can be classified a priori as 
congested. Suppose that a throughput-optimal allocation of 
buffer sizes for intermediate nodes is to be designed with the 
constraint that the total number of packets in the network 
be limited to 60. Clearly, a naive first guess is to assign 
Md — [20 20 20]. However, notice that no matter how large 
the buffer sizes are, the probability of blocking at any node is 
always non-zero. Hence, the rate of arrival that V2 and see 
is smaller than that noticed by vi . Therefore, it is meaningful 
to assign vi a larger buffer size to minimize blocking at vi 
and maximize throughput. Although this intuition is correct, 
it is unclear as to how to allocate buffers. The strength of 
the iterative technique is in resolving exactly this issue by 
assigning estimates to each buffer allocation configuration. 
By searching around the neighborhood of Md, the maximum 
throughput configuration is found to be Me ~ [27 20 13]. 

As is illustrated by these examples, the proposed iterative 
estimation techniques presents a framework to identify nodes 
in line networks that are either: (a) starved and therefore play 
an insignificant role in capacity and packet delay (such as 
W3 of Fig. [T2] l. or (b) congested and contribute significantly 
to packet delay (such as vi of Fig. [T2] |. or (c) contribute 
significantly to both capacity and packet delay (such as V2 
of Fig. fT2l i. On identifying these nodes, it is possible to 
identify configurations that make efficient use of the buffers 
without severely compromising on either throughput capacity 
or average packet delay. 
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VI. Conclusions 

This work focused on the effect of finite buffers on the 
throughput capacity and packet delay profile in line networks 
with packet erasure links. First, an exact Markovian frame- 
work for modeling line networks with perfect feedback was 
presented. The framework was simplified using independence 
assumptions to derive iterative estimation techniques that yield 
approximations of all marginal buffer statistics and also allow 
to identify the packet delay profile in such networks. Further, 
it was shown that the absence of feedback has no effect on 
the throughput capacity of line networks provided packet-level 
coding is permitted. Finally, via simulations, the proposed iter- 
ative techniques were noticed to be computationally-efficient 
and near-accurate models to analyze and study the behavior 
of line networks. 

Appendix A 
Discrete and Continuous models 

In this section, we argue that the discrete model assumed 
in the paper can be used to study the capacity of tandem 
queue model with type II blocking (see f9l) and independent 
exponential service times at each node. Consider a tandem 
queue of h links and h — 1 intermediate nodes. Suppose 
A = (Ai, . . . , Xfi) denotes the parameters for the exponential 
service times at wp, . . . , Vh-i, respectively. Assuming that each 
intermediate node has buffers given by = {mi, . . . , nih^i) 
and that Nt denotes the number of packets collected by Vh 
in the period [0,t), the throughput capacity CA(A^jZl of the 
system is defined by 
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can be computed from a discrete model assumed in this paper. 
For example. Fig. [T3] considers a four-hop system with each 
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Fig. 13. Throughput of continuous and discrete systems for varying buffer 
sizes 

node having exponential processing times with parameter A = 
2 s~^ and compares it with four discretized models. Note that 
the approximations become finer as smaller values of r are 
chosen. This fact can be formalized as follows. 

'Note that the definition of throughput using (37) hinges on the ergodicity 
of the continuous-time system. 



Ca(M) = lim T-'C^" (1 - At,M), (38) 

where the right-hand side uses the discrete-time model of (|4|. 

Proof: We begin by constructing the probability transition 
matrices for continuous and discrete chains that track the state 
of the system just before a departure from the last intermediate 
node. Note that both chains use the same state space S ~ 
{{si, . . . , Sh-i) ■ < Si < nii}, however their transition 
probabilities are different. 

Let n denote the probability transition matrix for the 
continuous model. Let AI denote the transition matrix that 
effects the change in states when a departure from the last 
intermediate node occurs and let Pt denote the transition 
matrix corresponding to changes in state over a duration of t 
seconds given that no departure occurs in that duration. Then, 
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where denotes the cumulative density function of the 
exponential RV with parameter Xh- Notice that for any r > 0, 

Pt = (P^L^J-Pt-rLiJ- Therefore, 

Pt = lim (P,) LIJ , . , lim (P.) L*J , (40) 

where (a) follows since Pt_.rL-J ~^ ^- Let P^ denote the state 
transition matrix for one time epoch of the discretized model 
with £ = 1 — At. Then, we have 



P, = P^ + o(t2) ^Pt= lim (P^ + o(r2) 
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Therefore, we have 



n = M / PtdF^Jt) = M I Hm (P,^) '^dF^Jt) 
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where (a) follows from Fubini-Tonelli Theorem [23 1. The 
discrete equivalent 11^ of the above transition matrix that 
tracks the state between departures for the corresponding time- 
discretized system is given by 



= XhTM + {XhT)XhTMP^ + [XhTfXHTMPf 
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Let ^(s, s') = lim^^o (n(s, s') - (s, s')) for any pair of 
states s, s'. Then, 
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Note that (b) follows since MP^^ is a probability matrix and 
hence each component is bounded above by unity, and (c) 
follows since for A/iT < 1, it is true that 



|l-e^''n-A,r = ^ 



< e{XhTf, 



i>2 



and (d) follows from > 1 — x for a; > 0. Thus — > 11 
as T — ?• 0. Let and v be the eigenvectors of 11^ and 11, 
respectively. Then, since the steady-state distribution of a chain 
is a continuous function of the transition matrix, it follows that 
i/^ — >■ z/, as T — s> 0. However, the capacity computed using 
continuous and discrete models are given by 

C^{M)^Xn ^(^) 

Therefore, r-iC^^(l - At, M) Ck{M). ■ 

Appendix B 
Proof of Lemma[T] 

(a) Suppose that the state of the system is n = 
(ri,i(/), . . . , with < nh-i{l) < rnu-i, then from 
([Hi, we notice that Yh = Xh and Yh-i ~ a[nh-2{l)]Xh-i. 
Hence, given the event nih-i > nfi-i{l) > 0, Y(/) = 
{Yi{l), . . . ,Yh{l)) depends only on . . . , n,i_2(0) and 
{Xi{l), . . . ,Xh{l)) and not on nh^i{l). This guarantees that 
{T-,Tt,n,) = (T^,T+,nj) for Q<i,j < m„_i. 

(b) First suppose h > 2. Consider F" for some i > 
and the state of the system at some time / G N. 
F^ represents transitions from states that have the form 
(ni(Z), n2{l), . . . , nh-i{l) = i) to states of the form (ni(Z + 
l),n2il + l),...,nh-i{l + l) = i-l). Since Uh-iil + I) = 
nh-i{l) — 1, it must be that Yfi^i{l) — and that the channel 
must have erased the packet transmitted by Vfi-2- Denote 

— ni<A:<i('^i + 1) foi" i > 1 and Li — 1. Then, it is 
seen that for any realization of {Xi{l)}'l^Q, it is true that the 
state transition must obey 



1 + rii(/) + 



h~2 
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< 1 + ni{l + l) + Y "»(^ + 1)-^^- 



However, ^1 + ni{l) + X]i'=2^ "-«(0 n}=i(™i + 1)) is the 
index of the row corresponding to the state n(/) within F^ and 
{l+n,{l + l) + J2'^Z^ ni{l + l) 11^=1 + is the index of 
the column corresponding to n(/ + l) within F^^. Therefore, all 
possible transitions in F^ correspond to transitions from states 
to other state that involve a non-positive change in the row- 
index. Therefore, F^ is upper triangular Finally, since each 
diagonal term of F^ is bounded below by Ilfc^o ^fc+i' 
conclude that 



h-2 



det(F,-)> (e/, n^fc+i)^' ' > 
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(45) 
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Finally, if ft, = 2, it is easy to see that F~ = [£2ei]. 

(c) Consider a transition under F^ for i < rrih-i from 
a state that has the form {ni{l),n2{l), ■ ■ ■ jfih-iH) = i) to 
another that has the form {ni{l + l),n2{l + 1), . . . ,nfi^i{l + 
1) = i + l) after an epoch. Since nh-i{l + 1) = nh-i{l) + 1, 
it must be that the packet transmitted during this epoch on 
the link (vft,-2, must have reached successfully, i.e., 
^/i-2(0 = 1- an argument similar to the above one, 
we can show that F^ is lower triangular. However, certain 
diagonal terms are zero. In specific, consider the transition 
from state {ni{l) = 0, . . . , n/i_2(0 — 0,n;i_i(/) = i) to 
the state {ni{l) = . . . ,nh-2{l) = 0,nh-i{l) = i + 1) 
which corresponds to the (F^)ii. However, this transition is 
impossible when h > 2, since the node w/i_2 has no packets 
to send during this epoch. Thus, det(F^) = if ft. > 3. 

(d) The non-singularity of / — fti follows from the fact 
that (/ — rii) is diagonal dominant [24], since (/ — il.i)kk > 
12k'^k ~ ^i)kk'\- On the other hand, since F^,F^ ^ 0, 
there exists at least one k for which the inequality is strict, 
which guarantees the non-singularity of these matrices. 

Appendix C 
Proof of Theorem[T] 

We proceed by mathematical induction on the time index /. 
Clearly, the condition holds for I = 0. Suppose that the claim 
is true for all nodes and for times I = 0, . . . , k for some fc > 0. 
Consider the states of the node Vi for some i = 2,...,ft, — lin 
both chains at time instant k. One of the two following cases 
must apply. 

1. ni{k) = hi{k): In this case, we note that 

n,{k+l)-fi,ik+l) = Y,{k)-Y,{k)-Y,+i{k)+Y,+i{k). 
If ni{k) = n,(fc) = 0, then Y,+i{k) = Y^+i{k) = and 
Y,{k) - Y,{k) = X,(fc)[(7[n,_i(fc)] - fT[n,_i(fc)] > 0. 
Thus n,{k + 1) - n,{k + 1) > 0. 

Now, if ni{k) = ni{k) = rui, it is seen from (HJ and ^ 
that -r,+i(fc) > 0. Further, if Y,+i(l) = 0, then 

clearly, ni(fc + l) = rrii and ni{k + l) < rrii = ni{k + l). 
If Y,+i{k) = 1, then Y,+i{k) = 1 and fi,{k + 1) < 
ni{k + 1) follows since 

y,(fc) - f,(fc) = X,(fc)(aK_i(fc)] - > 0. 
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Now, if < ni{k) = ni{k) < mi, then ([TJ and ^ 
again imply Y,+i{k) -Y,+i{k) > and Y,{k)-Y,{k) = 
Xi{k)(^a[ni^i{k)]—(T[ni^i{k)]) > 0, and hence ^^(fc) > 
hi{k) follows. 
2. ni{k) > ni{k) + 1: Assume let hi{k) > 0. Then, 



n,{k + l) < ni{k) + l- Yi+Ak) 

= fi,[k) + 1 - a[ni{k)]Xi{k) 

< fi,{k) + 1 - r,+i(fc) < n,{k) - 

< n,ik) + Y,ik) - r.+i(fc) = n,(fc + 1). 

Lastly, if ni{k) = 0, then the claim can be violated only if 
hi{k + l) = 1 and ni{k + l) — 0, which can happen only 
if Xi{k) = Xi+i(fc) = 1. However, under this channel 
instance, ni(A:+l) > ni{k) > 1. Thus, ni(A:i) > h{k+l). 

Thus, we have the following. 



ni{k) > ni{k), 



2,.../i- 1. 



(46) 



The proof is then complete by following the above argument 
for vi and interpreting (T[no(/c)] — <j[hQ{k)] = 1, since the 
source always possesses innovative packets. 

Appendix D 
Proof of Theorem[2] 

If h = 2 comparing ([B and we see that the AMC 
and the EMC are identical. Hence, we may assume h > 2. 
The proof in this case is based on mathematical induction 
on the time index I. At each time, we compare the state of 
the EMC with that of the modified AMC. Let the extended 
state of the EMC at an instant I E Z>o be denoted by 
n'^(/) ~ {ni{l), . . . ,nii{l)), where the notation is identical to 
that of Sec. Uni with the addition that denotes the number 
of packets that the destination has received by the epoch. 
Similarly define the extended state of the AMC with modified 
buffer sizes at an instant / G Z>o by q^{l)- Define a partial 
ordering of vectors of Z>q in the following manner. For two 

vectors v, v' 6 Z>q, v ^ v' if J2k=i — Y^k=i "fc ^^'-^ 
i — 1, . . . , h. We track the system starting from initial rest (all 
buffers being empty) using an instance of channel realizations. 
Clearly q'=(0) h n^(0). 

Suppose that q^{l) h for I — 0, ... k — 1. Consider 

I = k. One of the following two situations may aris^l 
1. {i < h : qf{k — 1) = "^i} = 0" this case, no 

node is saturated in the AMC and hence every node 
can potentially accept packets provided both the node 
preceding it has packets to send and the channel allows 
it. Consider the number of packets that are in the buffers 



of nodes ti, 



, Vh for some Q < j < h'm both chains. 



(i) If 7T,j_i(fc - 1) = or if both nj_i(fc - 1) > and 
Xjik — \) = {) are true, then 

h h h h 



^For convenience, we set qo{k) = no{k) = oo, fc > in tliis proof. 



(ii) If nj_i(fc-l) > 0, Xj{k-1) = 1 andgj_i(fc-l) 
then 



J2 <ll{k~l)> nl{k-l) 

h h 

=^ Qtik - 1) > E "^(^ - 1) + - 1) 

s=j s=j 
h 

>Yntik-l) + l. 

s=j 

Therefore, 

h h h h 

Yqtik) = Egl(fc-l) > E<(fc-1)+1 > Yntik)- 

s=j s=j s=j s=j 

(iii) Finally, if ?ij_i(fc - 1) > 0, Xj{k - 1) = 1 and 
Qj-iik - 1) > then 

h h 

h h 
s=j s=j 

Since j was arbitrary, it follows that q'^(fc) ^ n'^{k). 
2. {i<h:qf{k-l) = jy^^^Tn^}y^^: Then, let / = 

max{i < h : qf{k — 1) = "^i}- I" this case, nodes 

vj-^i, . . . ,Vh are not saturated and can accept packets. 
The argument for X]s=j 9s (^) ^ '^'s=j ^ti^) follows for 
j = I + 1, . . . , h is similar to the previous case. Notice 
that since the occupancy of nodes u^, i > / are not full. 



s>I 



1<L<I 



Now, for j = I, two cases may occur 
(i) If j = / > 1, then by (gT]), 

S>I S>I 1<L<I 

>E'?s(fc-l) + + ! 

S>I 

>E<(fc-l) + l>E"^(^)- (48) 

S>I S>I 

(i) If j = / = 1, and Xi{k - 1) ^ then 

Yfsik) - E'?^'(fc- 1) >T.^tik - 1) - E^sW- 

S>1 S>1 S>1 S>1 

However, if j / 1 and Xi(k — 1) = 1, then 
E Isik) > E itik - 1) + mi + X^ik) 

S>1 S>1 

> E <ik - 1) + nlik) + X2{k - 1) 

s>l 

>Y<^k)-Y2{k-\) + n]{k)+X2{k-l) 

(49) 



>E<(^) 
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Thus, the claim holds for j = I, I + 1, . . . , h. The claim 
is then complete if / = 1. Therefore, in what follows, we 
may assume / > 1. 

Finally, for 1 < j < /, one of the following cases must 
hold. 

(i) If Xi(fc - 1) = 0, then 

m 



J2qt{k)>Y.qt{k)> ^ <?,^(fc-l)+^™, 

s=j s — I s— 7+1 (-—1 

h I 

h h h 

s — 1 s — 1 s—j 

(ii) If - 1) = 1 then qf{k) > 1 and 



s>j 



S>I 



>^q:(fc-l)+^m, + a[2-j] 

S>I L=l 
I 



S>I s=j 

= ^<(fc-l) + l>^n:(fc). (50) 

s>j s>j 

Thus, the claim is true for all indices j = 1, . . . ,h and 
(f{k) h n^{k). Here, it must be noted that if the buffer 
sizes for the nodes of AMC are not modified as in the 
hypothesis, ( |47] | will not hold. 
Finally, the upper bound follows since 



lim > lim ^ =C^^(f,X). 
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Appendix E 
Proof of Theorem[3] 

Consider two rate-approximate solutions (r'^,pb°) and 
(r'',pb'') such that rJJ = rf^ = S with < 5 < 1. Notice 
that (p{0\rh-i,Sh,0) is a strictly decreasing function of rh^i 
when Efi is kept fixed. This follows from the fact that 

1 drh _ dip{0\ri,_i,eh,0) 



Eh drh-i 



drh- 



, l-r> 



1 



g \ 2^1=0 



< 0. 



(51) 



An easy way to understand this behavior is to notice that a, ao 
increase with r/,_i, while /3 decreases with r/i_i. Therefore, 
from (flSl l. it follows that 



= ri v'(0|r^i,£^+i,0) = ^(0|r,ti, 0) 



rh-i = r. 



(52) 



Now, from ( |52| | and (fl4l i guarantee Pb^^i ^ Pbh-i- We then 
use the monotonicity of (p{0\rh-2,£h-i,Pbh-i) ™ conjunction 
with already shown results to show that r^_2 = ^^'^ 



Pb'h-2 = Pbh-2- Extending this inductively, we have = r 
and pb" = Pb**- Therefore, for each 6 > 0, there is at most 
one solution satisfying rh ~ S. 

Now, consider two rate-approximate solutions (r"^, Pb"^) and 



(r*',pb'') such that < Sa 



5b < 1. By 



monotonicity of (^(0|r/i_i, e/j, 0), we have r^ -^ < rj^^i- From 
(O, we notice that is also a strictly increasing function 
in both its variables and Pbi+i- Therefore, Pb'fi^i < Pb\-i- 
Again, proceeding inductively from the last node to the first 
each time noticing the monotonic growth of (fl4l) and (fTSl) , 
we conclude that 



< 
< 



1, 



(53) 



However, since (r°,pb") and (r^,pb'') are both rate- 
approximate solutions, we have — r\ = ei, which 
contradicts ( |53T l. Therefore, there is at most one solution to 
the system of equations. 

To identify the unique solution, we construct a sequence 
of tuples {(r[^], Pb[^])};eN as described in Algorithm|2] Note 
that Step 2 of the algorithm can be replaced by a convergence- 
type step that halts if ||r[^] — r[/ — 1] ||i is smaller than a chosen 
threshold. 

Algorithm 2 Rate-based Iterative Estimate 



Count = 1 and [Count] = 0, i = 1, . . . , h — 1. 
while Count <Max_Iter do 

P6,JCount] = 0, r-i[Count] = 1 — Ei, and j — 1. 

while j < h do 

Compute rj+i [Count] 



,j [Count + 1] employing 
i fTSl l and G4J (that use [Count], Pb^+i [Count]) 

end while 

Count ^ Count + 1. 
end while 



By the monotonic property of the non-linear system of 
equations, the following results can be established. 



< Pb^[l + l] 



I e N. 



(54) 



However, each component of r and pb is individually bounded 
by unity. Therefore, the sequence of numbers for each compo- 
nent of these vectors must converge. Denote the component- 
wise limit as W* = (r*, pb*). Denote S : [0, 1]'' x [0, 1]'' — > 
[0,1]'' X [0,1]'* to be the following map. For each r, pb G 
[0, 1]'', denote S(r,pb) to be the pair, whose first component 
is the vector of rates computed from ( fTsT i and the second 
component is the vector of blocking probabilities computed 
from (fT4l) . Then, S is a continuous map and S((r[/], pb[Z])) = 
{r[l + l],ph[l + 1]) for each I £ N. Also, for this sequence of 
rates and blocking probabilities, we note that 



|S(W*)-W*||^<||(r[?],pb[/])-W*||o, 

+ ||S((r[Z],pb[;]))-(r[q,pb[ 
+ ||S(W*)-S((r[;],pb[;]))||, 



(55) 



However, the right-hand side of i55i is true for any / G N. 
By allowing / oo, the three limits vanish and hence we see 



XV 



that W* = (r*,pb*) is a fixed point of the map and hence 
the unique solution to the system of non-Hnear equations. 

Finally, to see the conservation of flow, notice that the Rate- 
based Iterative Estimate models the system using a discrete- 
time M/M/l/fc system by the introduction of additional as- 
sumptions and parameters. In the model, the number of 
innovative packets that are successfully stored by Vi as the 
system progresses from Z = to ^ = is given by 

Nr* (1 - Pr[n, = m,] + e.+iP^*+i Pr[n, = m,]) + o{N) 
= Nrl{\ ~ (^(m,|r*,£,+i,pfc*_^i)(l - + o{N) 



Nr*rb*+oiN). 



(56) 



Similarly, the number of packets successfully output by Vi is 
given by 

7Ve*+i(l - Pr[n, = Q])pE*+i + o{N) 
= iV£*+i(l - ^iO\r*,s,+i,pb*+i))pE*+i + o{N) 



(57) 



Since the M/M/l/fc system is lossless, all stored packets 
eventually leave the system. Thus, the average rate of packet 
storage at a node must match the average rate of packets output 
from that node. Comparing ( |56] | with ( fSTl ). the conservation 
of packet flow for the rate-approximate solution follows. 

Appendix F 
Proof of Theorem|4] 

The proof elaborates the behavior of a tandem system via a 
formal setup for the discrete-time equivalent of the G/M/l/k 
queue [25|. To illustrate the complications in the setup. Fig. [14] 
presents a section of an inter-arrival period for the first node. 
The number of customers in the queue of the node just 
before an arrival or a departure is presented on the axis. The 
arrival and departure of customers is marked by incoming and 
outgoing arrows, respectively. In Scenario A, we see that the 
queue is never starved and as a result all the inter-departure 
times are instances of the service process. 
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Fig. 14. A section of inter-arrival periods at the first server (assuming it 
possesses five customer slots). 

However, in Scenario B, we notice that all the five customers 
that are in the queue after the arrival are serviced much ahead 
of the next arrival and hence there is a period of time during 
which the queue is starved. If the queue were not starved. 



it could have possibly serviced a customer at the instance 
marked by the outgoing dotted arrow. Hence, this duration 
of time denoted by X in the figure, adds a delay to the inter- 
departure time. Thus, if we are able to extract the distribution 
{f^{i)}ie]N of this duration, we can identify the inter-arrival 
distribution 5°"' as seen by the second node to be a weighted 
sum of fx (E) <G{Cn) and ©(Cw). 

In order to identify the distribution fx, we need to identify 
the probability distribution tt of the number of customers 
in the first node's buffer just after an arrival. The first step 
in identifying tt from the imbedded Markov chain for the 
occupancy of the first node is to construct the distribution 
{Dj}je7L>Q of the number of packets that could be potentially 
transmitted during an inter-arrival duration Ta provided the 
queue were infinite. This distribution can be computed from 
the arrival and departure processes in the following manner. 



D, = J2 P^Ta = k] 



k=l 



fe=i 1=1 ^-^^ 



^ Entire: (:)<«")' 



1=1 



(a) 



J~ . N-1 -z 



, (58) 



where in the above, we use = C,n+C,x{'^—i) to incorporate 
the actual parameter of the memoryless service time, and in 
(a), we use j^^^ = E.>o (D^''""' < < 1. For 
each i,i G {!,..., to}, the entry of the probability 

transition matrix P„ for the imbedded Markov chain that tracks 
the number of customers just after an arrival can be computed 
by 

00 

(P.),,, = a[2 ~ j] D^) + a[3 - 1] A+i-i 

k—i 

+ CT[j-TO + l]A-j. (59) 

Note that in ( [59] ). we set Dk — when fc < 0. The distribution 
TT can then be solved from the eigenvector relation 7r(/— Pn-) = 
0. Note that a packet arriving at the first node will not be 
accepted if the node is in full buffer and no packet had left 
in the preceding inter-arrival duration. The probability of this 
blocking event at the first node is given by 



P(.g'", TOj, C,N,q) = t^tuDq- 



(60) 



Finally, we can identify the distribution of X by condi- 
tioning on the number of customers M just after a customer 
arrival. It is seen that for i, fc > 0, 



Pr[the queue is emptied at time j] 



Pt[X = i\M = fc] = ^ 

1=1 ^ / 1=1 



N-1 



xvi 



N-1 



-k 



1=1 

N-1 

1=1 



Cat \ ^ 

w J=l 



iCiCNy 



(61) 



From dMT l, we notice that the distribution of X conditioned 
on M = fc is a weighted sum of geometric distributions. The 
distribution of X can then be computed as follows. 



EZo ^k PT[X>l\M = k] 
7rfe(0Civ)'' 



E 

1=1 



Proof: Let Go be the set of all vectors b G F*^ such that 
12j=i ^i'^i = 0- Then Go forms a commutative group under 
componentwise addition. Similarly, let for each u e ,5^{U), 
let Gu be the set of vectors b e F'^ such that X]^=i ^i""* = ^■ 
It is follows that {G„ : u G ^(C/)} = F'^/Go, i.e., they are 
the coset translates of the subgroup Gq. Therefore, uniform 
selection of the coefficients to perform a linear combination 
results in the selection of a vector in ,S^{U) uniformly at 
random. Notice that dim(^(y)) = dim(^(y)) if and only 
if Y!j=i a^Wj e ^(C/) n y{V). Note that the occurrence of 
this event is improbable for large fields, since 



Pi =Pi 



E 



pi 

fc6{0,...,m} 
1={1,...,N-1} 



(1 - CiCn^ 



0<k<m 



(1 - CiCnY 



Pr [dim(^(V"')) = dim(j^^(V"))+l] = 1- 



dim{J^{U)n^{V)) 



^dim(j^(C/)) 



Also, we notice that the distribution of inter-arrival times g™' 
as seen by the second node is either an instance of /'^®(G(Cw) 
or that of (G{Cn), and hence can be written as 

^out A /„ fX 



Corollary 1: Let A be a A; x n matrix with entries from Fg 
such that rank(A) = r. Let b G F^ be selected uniformly at 
random. Then, 

Prf^b'^ 







(67) 
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{af"" + (1 - a)J) ®G(Cjv) 



(62) 



=T(g'n,m,Clv,<?) 



for some a G [0, 1]. The last step in constructing the inter- 
departure distribution is to identify a. This is done by noticing 
the mean duration between departures. Over a large duration 
N , the number of packets that are accepted at the first node 
is given by -^^(1 ~ 7^(.9'"j ™ii Cwi 'Z)) + o{N). The number 
of packets that are accepted by the second node is given by 
-(1 — q) + o{N). Since the system has finite buffer 



N 



size and no loss, the rates must match. Therefore, one can 
identify a using the following. 

1 . 1 



l-CiN 



(9'") 



Finally, notice that if /i A, we have 

1 - A 



(G(A) (G(Ai) = 



/i — A 

Using the above we can see that 



A — /i 



The basic idea of the proof is to construct a chain for the 
setting without feedback that is similar to the EMC. Once the 
chain is identified, the proof will be completed by showing that 
the transition probabilities of each transition approaches that 
of the EMC as the field size is made large. To this end, allow 
AIi{l) to be the packet received by the node Vi at the epoch. 
Set Mi{l) = if the {vi-i,Vi) channel erases the transmitted 
packet at the /* epoch. For the sake of proof, each epoch is 
divided into h sub-epochs. Since the network is assumed to 
work in a transmit-first mode, the network updates the buffers 
in a reverse-hop fashion, i.e., at the sub-epoch, the message 
generated by Vh^j is used to update that of Vh-j+i- Define 
Bi{l,j) = {Pi^kii, j) : fc = 1, . . . , JTii} to be the set of packets 
in the buffer of Vi after the sub-epoch of the epoch. 
For notational ease, let Wi{l,j) = span! IJ Bii{l,j)] for 

^h>i'>i ' 

i G {1, . . . ,h} and j G {1, . . . , h}. Note that the system is 
(G ( A) . (64) uniquely described by the dynamics of the nested vector spaces 
{yVi{l , h)} Define occupancy for this coded setting as 



(l-P(5'",m„Civ,g)). (63) 
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^-1 «7 



JV-l 

y — 

which is also a weighted sum of geometric distributions. 



G(Cw), (65) 



Appendix G 
Proof of Theorem[5] 

We present below a fundamental result that will be used in 
various stages of the proof. 

Lemma 2: Let X be a vector space over a finite field Fg and 
let y{A) = span(yl) for any A C X . Let U = {ui, . . . ,Uk} 
and V — {vi, . . . ,Vk'} be two subsets such that ^{V) C 
J^{UL)V). Then, let a be selected uniformly at random from 



F^' and set V = VU {T,j=i "^"^J'' '^^^^^ 



Pr 



<- 



ti(^(!7)n^(y)) 

jdim(J^(C/)) ■ 



(66) 



77,(Z,j)=dim(>V,(/,j-))-dini(>V,+iG,j)), \ (68) 

Notice that this notion of occupancy denotes the number of 
additional innovative packets that is housed by Vi at the 
epoch that has not been conveyed to downstream nodes. Also 
note that at the sub-epoch of the epoch, the only buffer 
that changes is that of Vh-j+i due to the receipt of Mh-j{l)- 
Thus, 

= ws.j -l),l<i<Ki^h-j + l 
VtilJ) = - 1): l<i<h,i^h-j,h-j + l 

To investigate the change of occupanc}0 after the sub- 
epoch of the epoch for nodes Vh-j and v^-j+i, we have 
to consider the following cases. 

'in accordance with the notation of j2) and (7), occupancies ni(l + 1) and 
"iC + 1) coiTespond to r]i(l, h) in this setup. 
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1. j = 1: If rjh-i{l — l,h) > 0, then by Lemma|2l we see 
that 

dim{Whil, 1)) = dim(span(W,,(; -l,h)U {MhH)})) 
= dim(W,,(; - l,h)) + 1 

with probabiUty at least 1 — ^- Therefore, it follows that 
7jh-iil,l) = rih-i{l-l,h)-l with high probabilit>0 If 
on the other hand r/;i_i(Z — 1, h) = 0, then 1) = 

and dim(W,,(/, 1)) dim{Whil - 1, h)). 

2. j > 1, Xh-j+iH) 0: In this case, Mh~j+i{l - 1) 
and there is no update at the buffers of the node Vh^j+i- 
Therefore, 

3. j > 1, Xh-j+i{l) = 1, r]h-j+i{l,j - 1) < ruh-j+i and 
rjh-j{l,j " 1) > 0: Notice that in this case, since the 
occupancy of Vh-j+i before the update is not full, there 
exists a G F"''"^+' \ {0} such that 



Vh-j{l,J~l) ■ (69) 
Vh-j+i{l,j ~ 1) 



E 

fe=i 



akPh- 



kil,J'l) e Wh-,+2{lj -^)- (70) 



(=w„_,+2(;,j)) 



Suppose that b G F<j is used to update 

Bh-j+i{l,j ^ 1) with the message Mh^j+i{l)- Then, 



J2 



fc=i 



+ ^kakPh-o+iA^.3 - 1) 1 e ^h-j+i{l.j) 



^ {^kakbk)Mh-j+i{l) e Wh-j+iilJ) 

Now, note that the vector a is not unique and that the 
set of all vectors that relate the contents of Vh-j+i 
form a vector space over of dimension rrih-j+i — 
Vh-j+i{l,j — !)■ Let ^ be the matrix generated by 
enlisting all the vectors in this space as rows. Then 
rank{A) = mh-j+i — r]h-j+i{l,j — 1) and by Cor. [l] 

PrMb^ = 0] — -i rr-r^. Therefore, by 

choosing a large field size, the probability of the event 
Mh-j+i{l) € Wh-j+i{l, j) can be made arbitrarily 
close to unity. Finally from Lemma |2] we see that 
if rih-j{l,j — 1) > 0, then Af;i_j_|_i(/) is innovative 
w.h.p. It is straightforward to see that in this setting, if 
r]h-j{l,j — 1) > 0, an innovative packet is conveyed 
w.h.p. from v^-j to Vh-j+i and 



rih-j+i{l,j) 
Vh-j{l,j) 



Vh-j+iilJ - 1) + 1 
Vh-j{lJ - 1) - 1 



(71) 



Finally, if — 1) = 0, we notice that both 

occupancies remain unaltered w.h.p. 

'"Throughout this section, by 'with high probabiHty' we mean that we can 
guarantee any probabiHty close to unity by choosing a large field size q. 



4. j > 1, Xh-j+ijl) ^ 1, ilh~j+i{l,j - 1) = rrih-j+i and 
Vh-j{l,j — 1) > 0: Suppose that by updating the buffers 
of Vh-j+i with Mfi-j+i{l) (using a randomly selected 
b), we introduce a linear dependency in the newly formed 
buffer enti-ies. That is, 3a e F™''"^+' \ {0} such that 

^kakPh-j+i,k{l,j) e y^h-j+2{l,jl 
{'Ekakbk)Mh-j+i{l) 



+ T.kakPh-j+i,kil,j - 1) 
. ■' 

e>Vft_,+i(;,j-i) 



" v ' 

cwft_,+i(;,j-i) 



^ {j:kakbk)Mh-j+iil) e Wh-j+iilJ - 1) 

^ {^kakbk = 0) V (Mh-j+iil) e Wh-j+i{l,J - 1)) 

However, from Lemma |2] and Cor [T] Pr[AIfi-j+i{l) G 
Wh-j+iHj - 1)] < 0(i) and PrE,a,6fe - 0] = i. 
Therefore, w.h.p. there is no linear dependency introduced 
after update and the occupancy is unaltered in this case. 
5. j > 1, Xh-j+i{l) = 1, r]h-j+i{l,j - 1) ^ m/t-j+i and 
Vh-j{l,j — 1) = 0: Just like before, the aim here is to 
show that there will be no change in occupancy. Since 
in this case, Vh-j has no innovative packets, the message 
it generates will be a linear combination of packets in 
Vk, k > h — j. Therefore, we can write Mii_j^i{l) — 
Y,f^ ekPh~j+i,k{^J-^)+W, where G Wh-j+2il,j- 
1) and e G F"''"^+'. Let b G f"''"^+' be used to update 
the buffer of Vh-j+i and let a G F™''~^+\ then. 



T,kakPh~j+i,kil,j) e Wh~j+2il,j) 

^kCLkPh-j + l.kilJ - 1) 

^ 1 
+ (Sfeafe6fe)Af,,_j+i(0 



e Wh-j+2(l,3) 



Note that the above is true if only if afc + efe(^; aibi) = 
for 1 < fc < m/i_j+i, since r]h-j+i{l,j — 1) = 
m/i_ Therefore, a linear dependency of stored pack- 
ets arises if and only if there is a non-trivial solution 
for (1 + e-^b)x = 0, where e = [ei, . . . , e„i^_^^J, and 
b = [bi, . . . , bm^_ _|_J. However, this occurs if and only 
if det (/ -I- e-^b) 7^ 0. Finally, note that this determinant 
is zero if and only if be^ = —1. However, this event 
occurs with probability O(^), since the vector b is chosen 
uniformly at random from F™''''^\ Therefore, w.h.p. 
there is no linear dependency induced in the contents of 
Vh-j+i and the occupancy of Vh-j+i remains rrih^j+i- 
To summarize, 

a. Although the dynamics of the system are driven by the 
spaces {Wi{lT the transitions and their probabil- 
ities depend only on {"qiil, j)}i>o, i = — 1, 
i = 1, . . . ,h, and not the spaces as such. Therefore, 
the system can be equivalently modeled using just these 
occupancy vectors as states. 

b. The transition probabilities for the chain given by occu- 
pancies {rii{l, h)}i>o, i ^ 1, . . . , h — 1 approach that of 
the EMC as the field size is made large. 
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Finally, since the steady-state probability is a continuous 
function of the probabiUty transition matrix, the steady-state 
probabilities of the chain for networks without feedback 
approaches that of the EMC, thereby guaranteeing that the 
throughput achieved by the random coding scheme over a line 
network without feedback is asymptotically the same as that of 
a line network with identical parameters and perfect feedback. 
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